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DATA HANDLING (3) 


Learning Outcomes and Assessment Standards 


Learning Outcome 4: Data handling and probability 
Assessment Standard AS 1(a) 
Calculate and represent measures of central tendency and dispersion in univariate 
numerical data by: 
five number summary 
box and whisker diagrams 
ogives 
variance and standard deviation. 


Overview 


In this lesson you will learn about: 


e Variance 
e Standard deviation. 


Lesson 


Measures of dispersion (or spread) about the mean (ungrouped data) 


There are two measures of dispersion about the mean: 


e Variance (s?) 
e Standard deviation (s or 0) 


These measures of dispersion about the mean are highly effective since they use all 
of the data values. The range is not a good measure of dispersion because it doesn’t 
eliminate outliers, which are not normally representative of the rest of the data. The 
interquartile range is better in the sense that it focuses on the central half of the 
data. The problem with the Interquartile Range is that it is centred about the median 
and uses only a few of the data values. The mean is a far better measure of central 
tendency and the variance and standard deviation are measures of dispersion about 
the mean. These measures are far better than the others. 


We will now illustrate these two measures of dispersion by considering an example. 
Example 1(without using a calculator) 


A basketball team consists of 10 players. The number of points each player scored 
during the season are as follows: 


22 33 38 39 41 52 55 61 67 75 


Step 1. Calculate the mean. 


— | 2x _ 22+33 +38 +39 +41 +52 +55 +61 +67 +75 _ 483 _ 
0 ooo 9G = 483 


Step 2 Calculate the individual deviations from the mean. Record your results in a 


table. 
POINTS SCORED (x) x-Xx 
22 22 - 48,3 =-26,3 
33 33 - 48,3 =-15,3 
38 38 - 48,3 =-10,3 
39 39 - 48,3 =-9,3 
41 41 - 48,3 =-7,3 
52 52 - 48,3 =-3,7 
55 55 - 48,3 =6,7 
61 61 - 48,3 = 12,7 
67 67 - 48,3 = 18,7 
75 75 - 48,3 = 26,7 

a(x-x =0 


You will probably notice that the sum of all these deviations from the mean equals 0, 
which is not very helpful if we are to make conclusions about how the data is spread 
about the mean. 


Step 3 Calculate the individual squared deviations from the mean. Record your 
results in a table. Add these squared deviations. 


By squaring the deviations, we are able to eliminate the negative signs so 
that we have more useful values. 


POINTS SCORED (x) =x) (x - x) 
22 22 - 48,3 = -26,3 691,69 
33 33 - 48,3 =-15,3 234,09 
38 38 - 48,3 =-10,3 106,09 
39 39 - 48,3 = -9,3 86,49 
Al 41 - 48,3 =-7,3 53,29 
52 52 - 48,3 = -3,7 13,69 
55 55 - 48,3 =6,7 44,89 
61 61 - 48,3 = 12,7 161,29 
67 67 - 48,3 = 18,7 349,69 
75 75 - 48,3 = 26,7 712,89 
axe-¥)=0 a(x -x) ?=2454,1 


Step 4 Determine the variance (mean of the squared deviations) 


Formula1 (Grade 11 and 12 Paper 2) 


If you are working with an entire population, use: © 
> Lw-xyP 
a n 


S 


to calculate the variance. 
Formula2 (Grade 12 Paper 3) &) 


If you are working with a random sample of a population, use: rey, 
s= ime to calculate the estimated variance of the population. y ‘ 
y) 
In this examplepwe are working with a basketball team. Therefore, we will use For- 
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mula 1 to calculate the variance because we are not dealing with a random sample 

of a larger population. The basketball team is an entire population in its own right. 

variance(s?) = ze ws a = 245,41 

Step 5 Determine the standard deviation (s) by square rooting the variance. 
standard deviation = s = eee = /245,41 = 15,7 


Determine the standard deviation interval for the data. 


Step 6 
This interval is given by: (x-$;xX +5) 
For the example used, the standard deviation interval is: 
(48,3 - 15,7; 48,3 + 15,7) 
= (32,6 ; 64) 
Step 7 Draw conclusions about the spread of the data about the mean by establish- 


ing how many of the data values lie within or outside of the standard devia- 
tion interval. 


In this example, it is clear that 7 of the 10 points lie within the standard 
deviation interval (33, 38, 39, 41, 52, 55 and 61). This means that most of the 
players performed well by scoring close to the mean points score. The team- 
work was very good for this team. 


A small standard deviation indicates that the data items are clustered around 
the mean. A large standard deviation indicates that the items are more 
spread out. 


Note: 


Example GP Example 1 (using a calculator) 


The standard deviation can be easily calculated by using a calculator. We will use the 
CASIO fx-82ES to illustrate the method. The sequence is as follows: 


Push the button: MODE 
Push the button: 2: STAT 
Push the button: 1:1-VAR 
Enter the points: 22= 33= 38= 39= 41= 52= 55= 61= 
67= 75= 

Push the button: SHIFT 
Push the button: STAT 
Push the button: 5: VAR 
Push the button: 3:xon 
Push the button: = 
The answer will read: 15,7 
To clear calculator: 
Push the buttons: MODE 

> Push the buttons: 1: COMP 


Example GP 
wy 


at 
o* 
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Example 2 


Consider the following table which shows the number of learners who obtained 
certain marks out of 30 for a class test. 


Marks (x) 


18 |19 |20 |21 


22 |\23 |24 |25 |26 | 27 


No of learners 


2 |3 |/4 |6 (9 |12-16 5 [3 


Calculate the mean and standard deviation for this data. 


Solution 

Marks x Freq f fxx x-x |(x-xy f x (x -x) 
18 2 36 -5,2 [27,04 54,08 
19 2 38 -42 |17,64 35,28 
20 3 60 =39 | 1024 30,72 
21 4 84 2,2 [4,84 19,36 
22 6 132 a2 | 8,64 
23 9 207 -0,2 |0,04 0,36 
24 12 288 038 |0,64 7,68 
25 6 150 1,8 [3,24 19,44 
26 5 130 28  |7,84 39,2 
7 3 81 3,8 (14,44 43,32 
Total 52 x= US = 23,2 258,08 

The mean for this data is 23,2. 

gc RAW a = 258,08 _-.4:9630769023 (variance) 


n ~ 52 


The standard deviation is: 


ie Zils = /4,963076923 = 2,23 


Example 2 (using a calculator) 


The standard deviation can be easily calculated by using a calculator. We will use the 


CASIO fx-82ES to illustrate the method. The sequence is as follows: 


Push the button: 
Push the button: 
Push the button: 
Push the buttons: 


Scroll down and push: 


Push the button: 
Enter the marks: 


Enter the frequencies: 


Push the button: 
Push the button: 
Push the button: 
Push the button: 


The answer will read: 


To clear calculator: 
Push the buttons: 


Scroll down and push: 


Push the button: 
Push the buttons: 
Push the buttons: 


MODE 

2: STAT 
1:1-VAR 

SHIFT SETUP 

3: STAT 

1: ON 

18= 19= 
22 = 23 = 
26 = 27 = 
2= 2= 
6= 9= 
5= 3= 
SHIFT 1 

5: VAR 

3:xon 


2,23 


SHIFT SETUP 
3: STAT 

2: OFF 
MODE 

1: COMP 


pools 
Solution 


oP Example 


Activity cP Activity 1 


1. The number of points scored by four Formula One racing drivers over a 
number of races are given below: 


A|1]1 ]1 ]2 /6 {6 {8 {8 /8 |8 
B /1 |2 |6 /8 /8 /8 |8 |8 |8 - 
Cj/1/1 12 /2 |4 |4 16 16 {8 J8 - |- 
D|2 |2 |2 |4 J/4 |6 |6 |8 |8 - 
(a) Calculate the mean and standard deviation for each driver without 


using a calculator. 

(b) Now calculate the standard deviation for each driver by using your 
calculator. 

(c) Discuss the performance of each driver by referring to the standard 
deviation for each driver. 


Z: The following table contains the number of learners who obtained certain 
marks on a class test out of 30. 


Marks 
No of learners 


20 |21 
a2 


22 |23 |24 |25 |26 |27 |28 |29 
4 |5 |7 |10 |13 |5 |4 {2 


Calculate the standard deviation for this data by means of drawing a frequency table. 


Lesson Gj Lesson 


Measures of dispersion about the mean (grouped data) 


Example GP Example 3 (without using a calculator) 


Consider the example of the Health and Fitness group. Calculate the standard devia- 
tion for this set of data. 


Calculate the estimated mean: 


Class interval |Frequency | Midpoint class interval | Frequency x Midpoint 
O- 9 years 0 - - 
10 —- 19 years 11 14,5 159,5 
20 — 29 years 14 24,5 343 
30 — 39 years 17 34,5 586,5 
40 - 49 years 13 44,5 578,5 
50 —59 years 7 54,5 381,5 
G) 60-69 years 6 64,5 387 
70 — 79 years 5 74,5 372,5 
(3) 80-89 years [4 84,5 338 
90 — 99 years 0 - - 
YS L, Totals 77 = 31465 
K 3146,5 


SS Estimated mean = —~>— = 40,8 
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Calculate the squared deviations from the mean: 


Class interval | f Midp (m) |fxm |m-x (m—- x)? | f-(m - x)? 
0-9 years 0 - - - - - 
10-19 years 11 14,5 159,5 |14,5 - 40,8 =-26,3 |691,69 |7608,59 
20-29 years 14 |24,5 343 |24,5-40,8=-16,3 |265,69 |3719,66 
30-39 years 17. |34,5 586,5 |34,5-40,8=-6,3 |39,69 674,73 
40-49 years 13. |44,5 578,5 |44,5 - 40,8 = 3,7 13,69 177,97 
50-59 years Z 54,5 381,5 |54,5-40,8=13,7 |187,69 |1313,83 
60-69 years 6 64,5 387 |64,5-40,8=23,7  |561,69 |3370,14 
70-79 years 5 74,5 372,5 |74,5-40,8 = 33,7 |1135,69 |5678,45 
80-89 years 4 84,5 338 |84,5-40,8=43,7 |1909,69 |7638,76 
90-99 years 0 - - - - - 
Totals 77 |- 30182,13 


Calculate the variance: 


2 
s n 


_ Yf-(m-x) _ 30182,13 
> ~ 77 


= 391,9757143 


Calculate the standard deviation: 


2 = [Zhlm =a = 391,9757143 = 19,8 


Example 3 (using a calculator) 


mca 
Example 


The standard deviation can be easily calculated by using a calculator. We will use the 
CASIO fx-82ES to illustrate the method. The sequence is as follows: 


Push the button: 
Push the button: 
Push the button: 
Push the buttons: 


MODE 

2: STAT 
1:1-VAR 
SHIFT SETUP 


Scroll down and push:3: STAT 


Push the button: 


Enter the midpoints: 
54,5 = 
Enter the frequencies: 


6= a= 
Push the button: 
Push the button: 
Push the button: 
Push the button: 


The answer will read: 19,8 


To clear calculator: 
Push the buttons: 


Scroll down and push:3: STAT 


Push the button: 
Push the buttons: 
Push the buttons: 


1: ON 
14,5 = 
64,5 = 
11= 
4= 

SHIFT 1 

5: VAR 

3:xon 


24,5 = 
74,5 = 
14 = 


34,5 = 
84,5 = 
17 = 


44,5 = 


13 = 7= 


SHIFT SETUP 


2: OFF 
MODE - ‘ 
1: COMP y 


Activity [7 Activity 2 


1; The table represents the percentage of income spent on recreation by 50 
people. 


(a) Complete the following table for this data. 


Percentage |f |Midp(m) |fxm m-xX (m—x) f x (m - x)? 


12<p£18 |8 


18<p£24 [20 


24<p£30 |12 


30<p£36 |8 
36<p£42 {2 
Total x= 


(b) Calculate the mean for this data. 
(c) Now calculate the standard deviation for this data. 


(d) Now verify your answer by using a calculator. 
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